JPEG2000-matched MRC compression of compound documents
نویسندگان
چکیده
The Mixed Raster Content (MRC) ITU document compression standard (T.44) specifies a multilayer decomposition model for compound documents into two contone image layers and a binary mask layer for independent compression. While T.44 does not recommend any procedure for decomposition, it does specify a set of allowable layer codecs to be used after decomposition. While T.44 only allows older standardized codecs such as JPEG/JBIG/G3/G4, higher compression could be achieved if newer contone and bi-level compression standards such as JPEG2000/JBIG2 were used instead. In this paper, we present a MRC compound document codec using JPEG2000 as the image layer codec and a layer decomposition scheme matched to JPEG2000 for efficient compression. JBIG still codes the mask. Noise removal routines enable efficient coding of scanned documents along with electronic ones. Resolution scalable decoding features are also implemented. The segmentation mask obtained from layer decomposition, serves to separate text and other features.
منابع مشابه
Document Compression Using H.264/AVC
It has been verified that H.264/AVC, the newest video compression standard, can also be used to encode still images. In many cases, it outperforms state-of-art coders such as JPEG2000. For compound documents, the gains over JPEG2000 are even more expressive. In this scenario, the contributions of the present paper are distributed over four document encoding methods that use the H.264/AVC as a b...
متن کاملMRC Compression of Compound Documents using H.264/AVC-I
The Mixed Raster Content (MRC) ITU document compression standard (T.44) specifies a multi-layer multiresolution representation of a compound document. It is expected that higher compression can be achieved if more efficient compression standards are used to compress each layer. In this paper we present an MRC compound document codec that uses the H.264/AVC operating in INTRA mode to encode back...
متن کاملSegmentation and compression of documents with JPEG2000
We review the standard JPEG2000 for still image compression and mention some typical applications. Special weight is put onto the core coding system described in Part 1 and the compound image file format for document imaging described in Part 6 including a section on image segmentation. Index Terms — JPEG2000, still image compression, mixed raster graphics, segmentation
متن کاملCompression of Compound Documents
Compound (or mixed) document images contain graphic or textual content along with pictures. They are a very common form of documents, found in magazines, brochures, web-sites etc. Because of the very distinct nature of those two image classes (text/graphics vs. pictures), their compression invariably involves multiple compression systems and a region segmentation (classification) method. We rev...
متن کاملOptimizing Block-Threshold Segmentation for MRC Compression
Compound document images contain graphic or textual content along with pictures. They are a very common form of documents, found in magazines, brochures, web-sites, etc. We focus our attention on the mixed raster content (MRC) multi-layer approach for compound image compression. We study block thresholding as a mean to segment an image for MRC. An attempt is made to optimize the block threshold...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2002